Hierarchical Bayesian Domain Adaptation
نویسندگان
چکیده
Multi-task learning is the problem of maximizing the performance of a system across a number of related tasks. When applied to multiple domains for the same task, it is similar to domain adaptation, but symmetric, rather than limited to improving performance on a target domain. We present a more principled, better performing model for this problem, based on the use of a hierarchical Bayesian prior. Each domain has its own domain-specific parameter for each feature but, rather than a constant prior over these parameters, the model instead links them via a hierarchical Bayesian global prior. This prior encourages the features to have similar weights across domains, unless there is good evidence to the contrary. We show that the method of (Daumé III, 2007), which was presented as a simple “preprocessing step,” is actually equivalent, except our representation explicitly separates hyperparameters which were tied in his work. We demonstrate that allowing different values for these hyperparameters significantly improves performance over both a strong baseline and (Daumé III, 2007) within both a conditional random field sequence model for named entity recognition and a discriminatively trained dependency parser.
منابع مشابه
A Hierarchical Nonparametric Bayesian Approach to Statistical Language Model Domain Adaptation
In this paper we present a doubly hierarchical Pitman-Yor process language model. Its bottom layer of hierarchy consists of multiple hierarchical Pitman-Yor process language models, one each for some number of domains. The novel top layer of hierarchy consists of a mechanism to couple together multiple language models such that they share statistical strength. Intuitively this sharing results i...
متن کاملModel Adaptation with Bayesian Hierarchical Modeling for Context-Aware Recommendation
Model adaptation is a process of modifying a model trained with a large amount of training data from the source domain to adapt a speci c similar target domain by using a small amount of adaptation data regarding the target domain. Bayesian hierarchical modeling is well known as a general tool for model adaptation and multi-task learning, and widely used in various areas such as marketing, ecol...
متن کاملBayesian Multitask Learning with Latent Hierarchies
We learn multiple hypotheses for related tasks under a latent hierarchical relationship between tasks. We exploit the intuition that for domain adaptation, we wish to share classifier structure, but for multitask learning, we wish to share covariance structure. Our hierarchical model is seen to subsume several previously proposed multitask learning models and performs well on three distinct rea...
متن کاملA Hierarchical Bayesian Approach for Semi-supervised Discriminative Language Modeling
Discriminative language modeling provides a mechanism for differentiating between competing word hypotheses, which are usually ignored in traditional maximum likelihood estimation of N-gram language models. Discriminative language modeling usually requires manual transcription which can be costly and slow to obtain. On the other hand, there are vast amount of untranscribed speech data on which ...
متن کاملHierarchical Bayesian space-time interpolation versus spatio-temporal BME approach
Abstract. The restrictions of the analysis of natural processes which are observed at any point in space or time to a purely spatial or purely temporal domain may cause loss of information and larger prediction errors. Moreover, the arbitrary combinations of purely spatial and purely temporal models may not yield valid models for the space-time domain. For such processes the variation can be ch...
متن کامل